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Introduction 


A. L. Leiner 


In the seven years since SHAC first began 
regular operation (April 1950), the feasibility of 
applying general-purpose digital computing ma- 
chines to a broad range of new problems has 
become well established. During this period, 
digital computers have been successfully put to 
work on 8 wide variety of scientific and engineering 
calculations, clerical and technical data-processing 
jobs, and real-time control operations. 

For some important applications, however, the 
opportunity of using automatic computers is still 
severely restricted by the limited speed at which 
available digital machines can carry out the basic 
arithmetical operations. For this reason a con- 
siderable development effort is still being directed 
toward the design of faster digital arithmetic 
devices. 

In attacking the general problem of how to 
increase arithmetic computation speeds, two dis- 
tinct paths are open to the computer designer. 
One approach consists in trying to develop elec- 
tronic components and circuitry capable of ampli- 
fying, storing and switching signals at higher 
basic repetition rates. The other approach at- 
tempts to gain over-all speed by finding methods 
for organizing slower electronic elements into 
more efficient over-all systems. In following the 
latter course, the increased problem-solving speed 
is obtained by harnessing the same simple com- 
puting elements together in more complex logical 
combinations. A typical example of the latter 
method, for instance, would be to organize the 
basic computing elements in such a way that 
several separate steps of a given over-all operation 
could be carried out simultaneously by different 
units in the same machine. A straightforward 
application of this principle, however, would often 


entail the use of excessively large amounts of 


additional equipment. As a rule, therefore, 
methods based on the use of duplicated equipment 
must also be coupled with the use of more subtle 
algorithms for carrying out the basic processes and 
the use of processing logics specially adapted to 
the physical peculiarities of the available comput- 
ing components. 

The two papers that follow represent attempts 
to produce faster arithmetic processing units by 
increasing the complexity of their logical structure 
without at the same time materially increasing the 
number of components used. In the first paper, 


which is concerned with addition and subtraction 
operations, the increased speed is obtained chiefly 
by using the early phases of the operation for 
generating certain special auxiliary functions of 
the digits in the numbers being added together. 
These auxiliary functions are then used later on in 
the process to facilitate the determination of the 
conventional “carry” signals that are required in 
the last stages of the operation. The adder 
designs illustrated are particularly suited to the 
special type of dynamic pulse circuitry developed 
at NBS during the course of the SHAC and 
DYSEAC programs. When associated with a 
suitably fast storage device, these adder designs 
yield over-all arithmetic processing units that 
perform up to 150 times faster than the earlier 
machines. With appropriate modifications, the 
same principles could be applied to other types of 
electronic circuitry. 

The second paper is concerned with the more 

general problem of how multiplication speeds can 
be improved by omitting certain time-consuming 
steps that are shown to be not strictly necessary. 
These steps are ordinarily included when the 
elementary definition of multiplication as a simple 
sequence of repeated addition operations is applied 
in too straightforward a fashion. The authors 
show how, by making a simple preliminary inspec- 
tion of one of the numbers to be multiplied, up to 
two-thirds of the usual addition steps can some- 
times be skipped. In this way speed increases of 
of 2.5 to 3 over standard methods can generally 
be obtained. This shortcut multiplication meth- 
od, unlike the previously described addition 
method, is of general applicability and not neces- 
sarily associated with any particular class of 
computing component or circuitry. In terms of 
equipment needs it compares most favorably with 
other proposed methods that involve the use of 
several separate addition devices. 
_ The present Circular contains the first part of a 
group of papers to be published on the logical 
design work carried out at NBS since 1948 on 
digital-computing, data-processing, and control 
systems. The second section, in a forthcoming 
Circular, will describe several digital systems other 
than the SEAC and DYSEAC that have been 
designed for various general-purpose and special- 
purpose applications. 


1. A Logic for High-Speed Addition 
A. Weinberger and J. L. Smith 


1. Introduction 


The development at the National Bureau of 
Standards of the diode capacitor memory [1,2], 
which is capable of being read or written into 
randomly at the rate of one word per microsecond, 
has made it worth while to build devices capable 
of processing information at comparable rates. 
Since the basic microo-peration common to most 
arithmetic processes is the adding of two numbers, 
it seemed desirable to design an adder having a 
cycle time no greater than lyusec. 

The major speed limitation in any adder is in 
the production of carries, and in this paper the 
problem is attacked from the standpoint of logical 
organization. Although work is being done else- 
where on this subject, using newer and faster 
basic circuit elements, the analyses to be described 
show that it is both feasible and economical to 
achieve l-usec addition times for 53-bit words, 
using the 1-Me circuitry that has been success- 
fully utilized in SEAC [3] and DYSEAC (4, 5]. 

The increased complexity of the logic of this 
adder necessitated the extensive use of Boolean 
algebra in arriving at the design itself. Because 
the procedure used in developing the final design 
is an interesting example of the practical applica- 
tion of Boolean algebra, the actual logic of the 
design process is described in considerable detail. 

Before discussing the adder, a brief description 
of the logical capabilities of the SEAC circuitry [6] 
will be presented. As shown in figure 1.1, the 
basic electronic unit consists essentially of three 
levels of diode gates in an OR-AND-OR logical 
array followed by a transformer-coupled pulse 
amplifier. The rate at which successive pulses 
pass through such a stage is determined by the 
clock frequency, which is, in this case, 1 Me/sec. 
The transit time of a pulse through a stage, how- 
ever, is much less than 1 usec. For this reason, 
the clock pulses are made available in several 
phases. The way in which different stages are 
controlled by clock pulses of different phases is 
illustrated in figure 1.2. In SEAC, for example, 
1-Mec clock pulses are available in 3 _ phases, 
¥% usec apart. In DYSEAC, 4-phase clock pulses 
are used, whereas for reasons that will be developed 
later, in the adder to be described a 5-phase clock 
isused. Figure 1.3 shows graphically these timing 
relationships for SEAC. Signals emitted from 
different stages clocked at different times must be 
synchronized by means of electric delay lines 
before they are gated in a common stage, as shown 


1 Figures in brackets indicate tho literature references on page 12, 


in figure 1.4. Both positive and negative signals 
are available from a stage, the negative signals 
being used for inhibiting (see fig. 1.5). 

The logical gating required in any stage of the 
adder to be described is essentially of the same 
complexity as that required in the packaged 
building blocks used in constructing DYSEAC, 
and in the OR-AND-OR gating configuration of 
a stage up to 4 AND-gates and up to 6 inputs in 
the largest AND-gate are permitted: 

Boolean notation of the sort described by 
Richards [7] will be used hereafter to describe the 
gating configurations. In figure 1.6 are shown a 
typical gating stage and the corresponding Boolean 
expression for the output in terms of the inputs. 
There are three terms in the expression, each one 
corresponding to an AND-gate; the first term, 


(A+ B)CDEF, corresponds to the top AND-gate; 


_ the second term, (@-++A) J, corresponds to the mid- 


dle AND-gate; and the last term, J/(K+2+M)N, 
corresponds to the bottom AND-gate. The fac- 
tors of a term represent the inputs to the corre- 
sponding AND-gate. For example, the five 


factors of the first term, (A+B), C, D, E, and F, 
correspond to the five inputs to the top AN D-gate 
Whenever a factor consists of more than one term, 
it is represented by an OR-gate. For example, the 
factor (A+B) of the first term corresponds to the 
2-input OR-gate of the top AND-gate. <A factor 
could also be a negative or inhibit signal, and in 
this case it is denoted by a bar on top; e. g., Cand 
D are two factors of the first term corresponding 
to the two negative signals, which may inhibit 
the top AND-gate. For the sake of simplicity in 
the discussion of the Boolean expressions that 
follow, no distinction is made between delayed 
and undelayed signals. 
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Figure 1.1. One stage of SEAC-type circuitry. 
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Figure 1.6. Typical gating stage and corresponding 
Boolean expression. 


2. Sequential Carry Generation 


The analysis leading to the design of the parallel = pressed in Boolean notation as follows: 
adder will now be described in detail. a oe _ 
Let Sy= AyByCy_1+ An BiCe_-1+ An BiCy-1 eae 
A=augend=A,2*"!+A, ,2*?+ ...4 A,2°, TaBuE 1.1. Function table for binary addition 


B=addend= B,2*"'4+ B,:2*?+ . . . +B,2°, ie Ne 
Addend_-__-| B, 
S=sum =S,2*-!1+8,_,2%?4+ . +8,2°, 
Previous 
C,=the carry resulting from the addition in 

the kth digit position. 
_The well-known rules for binary addition are 
given in the form of a function table (table 1.1). 
From these, the binary sum and carry can be ex- 


OS ABO PAB Cre PAC. 4 ABCs 
—A,B,+ACr1+BiCr-1 
= (A;+ B;) (Agt Cr_-1) (Bit Cr-1) (2) 
= A,B,+(A,+B,) Cr. 


The carry function, C,, has been reduced from 4 
terms of 3 factors each (corresponding to 4 AND- 
gates with 3 inputs each), as shown in the top line 
of eq (2), to 3 alternative forms, each involving 
fewer terms and factors. 

Since the expression for S;, in eq (1) can be 
implemented in one gating stage, any sum digit 
can be made available during the clock phase 
immediately following the formation of its cor- 


responding carry, C,_,. However, if the carries 
are generated according to eq (2), each carry digit 
would have to await the formation of the next 
lower-order carry. As a result, the sum digits 
could be obtained at the rate of only one per clock 
phase, for if C, is available during the first clock 
phase, C, could be generated during the second 
clock phase, C; during the third clock phase, ete. 
For numbers having n binary digits, n—1 possible 
carries would have to be provided for, requiring 

n—1 clock phases for their complete determina- 
tion. If a 4-phase, 1-Mc clock were used, 4 suc- 
cessive sum digits could be obtained during 1 usec. 
Such an arrangement, using sequential carry 
generation, would provide an increase in speed of 
a factor of only four over the addition speed of a 
completely serial adder. 


3. Simultaneous Carry Generation 


The limitation on the sequential method of 
forming the carries stems from the use of eq (2), 
which specify C;, as an explicit function of C;_;. 
It can be shown that a carry need not depend 
explicitly on the preceding one, but can be ex- 
pressed as a function of only the relevant augend 
and addend digits and some lower-order carry. <A 


considerable gain in speed may be obtained as a 
result of this. 

Using the functional form given by the last 
equation in (2), successive carries are shown to be 
expressible in terms of the same lower-order carry 
by a method of substitution. 


C= A,B, 
+ (Ai+ Bi) Co 
C= A,B» — A,B, 
+ (A+ Bo) CQ + (Ap +B2) AB, 
+ (Ag+ By) (Ai + Bi) Co 
C= A;B; —> A;B3 
+ (Ag+ Bs) C2 + (Ag+ B3) A, Bo 
+ (As+ Bs) (A+ By) AB, 
+ (Ag+ Bs) (Az+ Be) (Ai+ Bi) Co (3) 
C= A,B, = A,B, 
=F (A,+ By) C; cs (A,+ By) AsB; 
+ (Ag+ Bi) (Ag+ Bs) Aa Be 


+ (Au+ By) (A3+. Bs) (Ao+ Bo) AiBy 
+ (Ag+ By) (A3+.B3) (A+ Be) (Ait By) Co 


oar A,B, 


+ (Ag+ By) A; B; 
+ (Ag+ By) (Ag+ Bs) AaB; 
+ (Ag+ Ba) (A3+-Bs) (Ag+ Be) (Ai+ By) (Ar+ Co) (B,+Q)). 


Equations (3) show how as many as 4 successive 
carries can be expressed as functions of the same 
carry, with all expressions consisting of no more 
than 4 terms and with the largest term consisting 
of no more than 6 factors. These 4 carries can 
therefore be generated simultaneously by means 
of only 4 gating stages. 

Similarly, the next more significant four carries, 
C; through Cs, can be formed simultaneously dur- 
ing the next clock phase as functions of the appro- 
priate augend and addend digits and C,. In short, 
four successive carry digits can be formed simul- 
taneously every clock phase. One gating stage 
per carry is required. 

To summarize, if Cy is available in the first clock 
phase, C; through C, can be generated during the 
second clock phase, C; through C; during the third 
clock phase, etc. Each group of sum digits can 


be obtained one clock phase after the corre- 


sponding group of carries has beenformed. Figure 
1.7 illustrates in block-diagram form an adder 
utilizing this principle of simultaneous carry 
generation. | 7 7 


Ist. 987654321 
CLOCK IAJA|A|A| {AUGEND 
PHASE ADDEND 


2hd. 
CLOCK 
PHASE 


CARRY 

GENERATION 

3rd. 
CLOCK 
PHASE 


4th. 
CLOCK 


PHASE SUM 


Figure 1.7. Nine-bit parallel binary adder. 


4. Use of Auxiliary Carry Functions 


Of signal importance is the use that can be 
made of the second clock phase to further speed 
up the addition process. ‘This time can be utilized 
to form certain aunliary carry functions, which 
enable additional carries to be generated during 
the third clock phase simultaneously with the 
carries C; through C3. More specifically, Cy, Cio, 
etc., can be formed during the third clock phase as 
functions of C, if some of the terms involving only 
the augend and addend digits in the expanded 
relations for Cy, Cio, etc., are combined as auxiliary 
carry functions in separate stages during the 
intervening clock phase. 

For example, the expression for Cy is shown in 
the first equation in (4) expanded as a function of 
C. Because of limitations on the gating com- 
plexity, it is not possible to form C, directly even 
if it were reduced to four terms. Instead, the 
function is implemented by parts. 


Cy=|AgBy 
+|(Ag+By) AgBs 
T| (Ag+ Bg) (Ag+ Bs) A,B, 
+} (A>+ Bo) (Ag+ Bg) (A7+ By) AgBe 
(Ag+ Be)(As+ Bs)(Ar+Bz) (Ag+ Bs) AsBs 


(4) 


al inlsbichtiteessibiaiat si biiuiditien Cs 


C= A» 


+1 Yo IC, 


(The outlines drawn around the various parts of 
eq (4) serve merely to correlate the corresponding 
parts of the two equations.) The 5 terms en- 
closed within the triangle can be reduced to 4 
terms by combining the first 2 terms. This re- 
duced 4-term expression can then be implemented 
in 1 gating stage during the second clock phase, 
and it is then designated by X5. The single factor 
enclosed within the rectangle can also be im- 
plemented during the second clock phase in one 
gating stage. It is designated by Y»). By means 
of these 2 auxiliary carry functions, X, and Yo, 
the actual carry C, can be formed quite handily 
in 1 gating stage during the third clock phase, 
according to the second equation in (4). 

The next 4 carries, Cio through C3, can also be 
formed during the third clock phase by utilizing 
these same auxiliary carry functions. The most 
complicated of these expressions, the one for C43, 
is given in eq (5), where further combinations are 
made to reduce the number of terms to four. 


C3=Aj2Byi3 

+ (Aist+ Bis) Air2Bi 

+ (Ajs+ Bis) (Apt By) An Bu 

+ (Ay3+ Bis) (Are + Biz) (An + Bu) AwBro 

+ (Ag+ Bis) (Ay2+ Biz) (Au + Bu) (Aro+ Bio) Xo 

+ (Aj3+ Bis) (Ait Biz) (Aut Bu) (A+ Bio) YoCs 

(5) 
C13= (Ajst Bis) (Ais+ Ais) (Ais + Biz) (Bis + Ais) 
(By3+ By,) 
+ (Aist+ Bis) (Apt By) AnBu 
+ (Ag+ Bis) (Ar + By) (Au + Bu) AwPyo 


a (Ay3+ By) (Ayo+ Bis) (An + By) (Ajo+ Bro) 
(Xo+ Yo) (Xo+ Cy). 


Figure 1.8 illustrates a parallel adder that will 
complete an addition on 14 binary digits in 4 
clock phases, using 1 pair of auxiliary carry 
functions. 

By means of additional auxiliary carry functions 
it is possible to extend still further the sequence 
of carries that can be formed in the same clock 
phase. For example, as shown in eq (6), Ci, can 
be expressed as a simple function involving C, 
and another pair of auxiliary carry functions, Xy, 
and Yi4, which are defined implicitly in eq (6). 
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+|(Ay+ By) Aj3Bi; 
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Figure 1.8. Fourteen-bit parallel binary adder. 
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O13, Cg, and Ci, can also be implemented in single 
stages as functions of C, by using the same two 
pairs of auxiliary carry functions. Cis, Cy, and 
Coo require still a third pair of auxiliary carry 
functions in order that they be generated during 
the same clock phase as functions of C4. 

If it were desired, a total of 25 carries could be 
generated simultaneously as functions of C, dur- 
ing the third clock phase without exceeding the 
limitations on gating complexity. However, if 
the number of simultaneous carries is limited to 
16, only 3 pairs of auxiliary carry functions are 
required. Figure 1.9 illustrates a parallel adder 


that can add numbers of 21 binary digits in 4. 


clock phases, utilizing this scheme. 


444675—58———_-2 


4th. 
. CLOCK 


21 20 19 18 I7 16 15 14 13 12 11 10 9 


(6) 
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Fiaure 1.9. Twenty-one-bit parallel binary adder. 


5. Two Levels of Auxiliary Carry Functions 


To extend the parallel adder to accommodate 
53 binary digits, it will be shown that only 1 addi- 
tional clock phase is necessary, and that during 
the fourth clock phase the carries C,, through Cs, 
can all be generated as functions of Cy. The en- 
tire parallel array of sum digits, S, through Sj, 
can then be formed during the fifth clock phase. 

The ability to generate all of the carries Cy 
through C;, during the fourth clock phase stems 
from the fact that two clock phases are available 
between these carries and the input digits. This 
permits the formation of two levels of auxiliary 
functions. ‘The first level consists of sets of X’s 
and Y’s, which are functions of the relevant 


Cop =n AzB 32 
= (Az. 23 Bs) As Bay 


augend and addend digits only, as was the case 
previously. The second-level auxiliary carry func- 
tions are generated by sets of stages labeled Z and 
oe are functions of certain first-level functions 
only 
Figure 1.10 illustrates in block-diagram form 
the complete 53-bit adder, which makes use of 
first-level and second-level auxiliary carry stages. 
As in the case of the preceding carries, C2, through 
Cy, are generated as functions of the appropriate 
augend and addend digits, some of the first-level 
auxiliary carry stages, and Cy. For example, the 
most complicated of these, C2, 18 Shown in eq (7) 


— to be reducible to four terms. 


+ (Age + Bap) (Asi + Boi) Ago Boo 

+ (Ase + Bas) (Asi + Bar) (Ago + Bao) Xog 

+ (Ase+ Bas) (Asi + Bar) (Azo+ Bao) Yoo X25 (7) 
+ (Age+ Boe) (Asi + Bai) (Ag0+ Bao) Y29 Vo5Co0 


C32 (Aze+ Bao) (Aga t+-Asi) (Age+ Bar) (Bs2+ Asi) (Bae + Bar) 
+ (Age + Bye) (Asi + Bai) Aso Boo 
+ (Age + Bo) (Asi t+ Ba1) (Ago + Boo) (X29+ Y29) (X29-+ X25) 
+ (Ags + Bos) (Asi +-Ba1) (Ago-+ Bao) Y29 YasCoo. 


53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14:13 12 W109 8 7 6 § 4 3 2 I 


Ist. 
CLOCK 


_ | AUGEND 


PHASE 


2nd. 
CLOCK 
PHASE 


pt | Hill 


ill 


3rd. 
CLOCK 
PHASE 


4th. 
CLOCK 
PHASE 


5th 
CLOCK Is] s| 
PHASE 


[sis{s|s/sisis{s}s/sisisisisis|s]s}s|s/sjs/s|s/s| 


ee ere 
[e{e ie |8 |e [8/8 (8/5 (8 |B |B (a | 8/8 1B 18 (B 8/8/88 /8|8\/B8/BlB\B8 |B. Ab ¢ co | 


lie are 


CIT MA A 


Ici cic| Cl] C| cl ic icl cl 


eae 


AODEND 


CARRY 
GENERATION 


etree tr ter Caen can 


fia tannin 


lc [cle |clci|e[clcjele|clejc| 


ao; oa oa oo; wo oO; ao; a) a 
dj dj di di ai ai ai aia dt) q| < 


is[s|s[s/s[s[s]s[s/s]s|s/s[s]s]s]s]s|s{s]s|s] 


HL | | I li Lil ai 


Figure 1.10. Féfty-three-bit parallel binary adder. 


The next higher-order carry, C33, requires a third 
pair of auxiliary carry functions, X33 and Y33, as 
shown in eq (8). Also, at this point it becomes 
economical to form a pair of second-level auxiliary 
carry functions, Z33, consisting of the terms within 
the solid-line triangle, and W433, consisting of the 
factors within the solid-line rectangle. C33 can 
then be simply generated by means of Z3 and W,3;, 
as shown in the last of eq (8). 


Csg = A33.B33 
Ss (A33 a B 33) Agb 32 
a3 (As3s zi B33) (Az. a B32) A; Bay 


aa (Ass ae Bss) (Asp ae Bs) (As; ae Bs) AzoB3o 


ee ee ee ee we ew ee ee we wee ee ew ww wee ewe 


=F (A33-+ Bs) (Ag2-+ Bye) (Ag, + B31) (Aso+ Bao) Y og X98 


ar (Ass ae B33) (Age aa Bz2) (As, as B33) (A450 si B30) 3 Yo9 Y 95 


Cop 


+|Yast Yo0Y2s 


C33 233 


++ ]W33| C20 


The subsequent carries, C4, C35, etc., are simi- 
larly generated by means of these and, when neces- 
sary, other second-level auxiliary carry functions. 
For example, for the carries up to C3, the same 
pair of second-level functions, W3; and Z33, is 
sufficient, whereas C3, requires the use of another 
pair of first-level and second-level auxiliary carry 
functions in a manner exactly analogous to the 
formation of C43. 


Cry 2 


<2 


The last digit position where auxiliary carry 
functions are introduced is at 48. The carry at 
this position, Cg, is shown in eq (9) to be a simple 
function of the last pair of second-level auxiliary 
carry functions. 


Cyg= | Xa 
+] VasXas 
+ | Vag Y43.X38 
++] Vag Vag V3g-X33 
+1 Vg V3 Vag Y 33X29 
+] Vag V43 Vag Y 33 Y 29X25 


4+1Y sg 43 38Y ag¥ 20¥ 25 Cop (9) 


ae Vag Yas Y 38 Y 33 Y 29 Y 25 


Cro 


Cg= Lag 


= Wel 


The collected Boolean expressions for the 
auxiliary carry functions and for the carry func- 
tions themselves for this particular 53-bit adder 
are given in tables 1.2 and 1.3. 

The number of gating stages required to imple- 
ment this design can be seen by examining figure 
1.10. Each square box in the diagram represents 
one gating stage. Of the 238 stages used in the 
whole adder register, note that only 26 are used 
to create the auxiliary carry functions. The 
other 212 stages are needed irrespective of how 


10 


the carry digits are formed, because they comprise 
53 sets of 4 stages for the augend, addend, carry, 
and sum digits. 


TABLE 1.2. Auxiliary carry functions 


F, represents (Az +Br)(Azt+Ai-1) (Art Ba-1) (BetAs-1)(BrtBu-1). 
Dx represents ArB,y. Rx represents (Ax.+Bx). 


Xo =Fyt+ RyoRgD7+ Rohgh; De 


Ryligh 7 hgDs 
Xw = Fyt+ RyRigDe4t RuRighy.Diy 
+ Ry high yh; Dio 
1g = Dig+ RigDizt RighizD 16 
+ Righty, yeDis 
xX 2 PF 25 ie Ro, Ro Dog a Ro5 Fog Rog Deo 
+ Ro5fo4 Ro3fo2Do1 
X99 = Dog+ Rog Dog+ Rog fogDo7 
29fvoglo7 Dog 
X 33 = D33-+ Rs3D32+ Reght32D1 
+ Rg hgoh31 D0 
X 3g == F'3g-+ Raghs7 Dag + Roglts7 Fez D 3s 
+ Reglts7fsglt35De4 
X43 = F4g+ Rag hyoDa + Rag ha. ha, Dao 
+ Rag hag Ra f4oD39 
X 4g = Pag-+ Raghtez Dag + Ragha7 Rag Das 
+ Ragha7 Rag Ras Das 


Z33 == Xagt+ Y33Xo9-+ Y33 Yoo X05 
Z3g == X3g+ Y3gX33-+ Vg Ya3X29+ Vag V33 Yoo X25 


Z43 a X 43+ Ya3X3g-+ Y43 YgX 35 
+ Y43 Vg Y33(Xe9-+ Yoo) (Xe9+ X25) 


Zag = Xagt VagX a3 
+ Vag Ya3(Xag-+ Vag) (Xag+ X33) 
+ Vag Ya3 Vag Ya3(X29-+ Yoo) (X29-+ X25) 

Yo =RoRgh7RoRs 

Yu =RyRypR phy 

Yig = RighyRyRis 

Yos = Rog RoyRoghooho1 

Yo = Rog Rogho7kog 

Y33 = Rg h39h31F39 

Vag = Raghg;Raghss hag 

Vg =RyRahn Rakes 

Vag = RagRayRaghasRag 

W33= V3 Yoo Yes 

W3g== Ys V33 Y29 Y 25 

W13= Ya3 Y3g Y33 Y29 Y 25 

Wag= Vag Y43 Y3g 33 Y29 Y25 


TaBLE 1.3. Carry functions 


Fy represents (Ak+ Bx) (ArtAr-1) (Art Bu-1) (BirtAx-t) (Bit Bi-1). Di represents AxrBy,. Ry represents (Ax+Bx). 


C, =D,+ RCo 

C, = Do+ R2D,4+ R2hyCo 

C; = D3+ R3D24+ R3R2D,+ hgh. Co 

Cy = Dit Ry D3+ RaR3D.+ RyRghohy(Ay+ Co) (By+ Co) 

Cs =Ds+ RsCg 

Ce ae D6 a ReDs+ RehsCs 

C, = D+ BR; Det Ry ReDs+ Ry hghsCy 

Cy = Fst Rah, Det Reh; Re Dst+ Reh Re RsC, 

Cy =Xot Vos 

Cyo= Dio + Ryo(Xo+ Yo) (Xo+ C1) 

Cy=Dyit Ry Dio+ Rukyo(Xot+ Yo) (Xo+ C4) 

Cye=Dypt RpDi+ kph Dit Rikhihio(Xot+ Ye) (Xo+ Cy) 

Cy=F yt RyhpDut Rishi Dit Rishi Riyo(X9+ Yo) (Xo+ Cy) 
Cu=Xut YuXot YuYoCs 

Cys= Dist RisX14t Bys Vis Xo+ Yo) (Xo+ Ci) 

Cro= Digt Rye Dis+ RighysX14+ Rig Ris Via(Xo+ Yo) (Xo+ Ci) 

Cy=Fyrt+ Ri Dist RivhighisX 4+ By highs Via Xot+ Yo) (Xo+ Cy) 
Cis=Xigt VYisXut YisYuXot VYisYiuYoC, 

Cio= Dig t+ Rig( Xig+ Vis) (Xist+ X14) + Rig Vis Vis(Xot+ Yo) (Xo+ Ce) 
Coy = Dro + R2oD19+ Roolig(X1s-+ Yrs) (Xis+ X14) + Rook Vis Yis(Xo+ Yo) (Xo+ C) 
Cy = Da + RoC 

Co2= Dog + Ro Do + Rog ho1Co 

Co3= Dog + B23 Do2-+ Bo3ho2Do1 + Rog Rho; Co 

Cog = Peat Rog Fo3 Dog + Rog Fog Ro2 Doi + Figg Rog R22 Ro Coo 

Cy5= Xop+ Yo5Co9 

Cog= Dog + RogX 25+ fg Yo5C29 

Coy = Dez + Roy Dog + Roz hgX 25+ Roz Rog Yo5 Cro 

Cog= Pog + Rog ho, Doe + Rag haz og X 25+ Rog hoz R26 Yo5C x0 

Cog= Xog+ Yoo X05-+ Yo9 Y25C20 

Cy9== D3o+ Rzo(Xe9+ Y 29) (X29 X95) + Rego Y29 Yo5C 29 

C31= Dai + Rai Dg0-+ si fg0(X29-+ Yoo) (X29+ X25) + Bai Bao Y29 YosC 20 
Cyo= P32 + Rakai Dgo + Bgol31A39(X 29+ Yo9) (X29+ X25) + Rao h31 30 Yoo YosCoo 
C33= Z33-+ WeagCro 

Cz4= Dos + R34(Z33-+ Wes) (Za3-+ Coo) 

C35== D35+ R35D34+ Rg5Rge(Z33 a Was) (Z33-+ C9) 

Csg= Dog + R3¢Do5-+ Rye lssDo4+ Rag lg5f34(Z33-+ Wo) (Za3-+ Coo) 

C37= F'37-+ B37 Fe36.D35 + R37 36 h35D34-+ Rez Pegg lg5lg4(Z33 + Ws) (Z33-+ Co9) 
Cyg= Zag-+ WagC ro 

C39 = Dgg+ Rzg(Zag-+ Wes) (Z38-+ Coo) 

Cyo= Dao+ Rap Do9-+ Rao lz9(Z3g+ Ws) (238+ C20) 

Cy = Dy t+ Ra Dio+ Rai Rap Do9+ Rar Baohs9(Z3g-+ Weg) (Zag+ Cro) 

Cry= Fyg+ Rag ha Dig + Rag Pai Rip D39+ Rag Far Rao P39(Z3g-+ Wg) (23g + Cro) 
Cy= Zag+ WasC 20 

Cu= Dat Ras(Zeg+ Wag) (Za3+ Cro) 

C1s= Das Rap Daa+ Rag Pega (Za3+ Was) (Za3-+ Cr) 

Cyg= Dag t Rag Dagt+ Ragas Dar Rag hss Raa(Za3 + Wa) (243+ Cro) 

C= Fay t+ Ray Rag Dig + Raz Raghss Das + Raz Rags Rag (Za3+ Was) (Z43-+ Co0) 
Cig= Zag+ WasgCro 

Cig= Dag + Rag(Zag-+ Wag) (Z4g+ Cro) 

Cro= Dso+ RsoDag+ Ryo hvag (Zag at Wz) (Zag af C29) 

Cs1= Dot + B51 Ds0+ Bsikes0Dag + Boi RoR (Zag+ Was) (Zag + Coo) 

Cro= Psa + Reels: Ds0+ Rooks R59 Dag + Roos: Reoltsg(Zag-+ Wag) (Zag+ Co) 


ie 


As indicated previously, five clock phases are 
occupied, starting with the input digits and ending 
with the sum digits. As the adder is to be used 
for multiplications and divisions in a repetitive 
fashion requiring the recirculation of the sum 
digits back into one of the inputs with appropriate 
shifts, the clock pulses must occur in five phases 
to allow an addition cycle to be completed in 1 
psec. 

The top line of table 1.4 gives some statistics 
on the number of components required for the 
adder represented in figure 1.10. Two other 
slightly different versions have been worked out 
in which fewer gates need to be driven by the 
most heavily loaded tube. As the table shows, 
these variations also require different proportions 


of components. Approximately 10,000 germanium 
diodes are required in each of these versions. 


TaBie 1.4. Number of components required 


Maxi- 
mum 
load ! 


Number 
of stages 


Delay 
Tubes lines 


1 Unit of load=one gate-load. 
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2. Shortcut Multiplication for Binary Digital Computers 


J. L. Smith and A. Weinberger 


1. Introduction 


The usual method of accomplishing automatic 
multiplication in binary digital computers con- 
sists of repeated additions of the multiplicand (the 
partial product) to the partial-product sum under 
the control of successive single digits of the multi- 
plier beginning at the least-significant end. A one- 
position shift accompanies each addition cycle, 
either the partial-product sum to the right, or the 
multiplicand to the left. The process is somewhat 
analogous to the pencil-and-paper method of 
performing decimal multiplication, except that 
instead of storing all the partial products sepa- 
rately and adding them together as the final step, 
a running sum of the partial products is maintained 
in an accumulator. This method requires as many 
add-and-shift cycles as there are multiplier digits. 
The accumulator is cleared initially. During the 
first addition cycle, if the least-significant digit 
of the multiplier is a 1, the multiplicand is added 
to the contents of the accumulator, whereas if it 
is a 0, nothing is added. In either case the 
contents of the accumulator and of the multiplier 
register are shifted one position to the right. Simi- 
larly, during the second addition cycle, the next 
more significant digit of the multiplier determines 


whether the multiplicand is or is not added to the 
contents of the accumulator. Again, in either 
case, the contents of the accumulator and of the 
multiplier register are shifted one position to the 
right. After each multiplier digit in turn has 
controlled the addition of the multiplicand to the 
contents of the accumulator (with the one-position 
shift for each cycle), the multiplication operatior 
is complete, and the final product is available in 
the accumulator. 

To complete the description of the multiplica- 
tion process, it should be pointed out that the 
length of the final product (assuming full sig- 
nificance) is generally equal to the sum of the 
lengths of the multiplicand and multiplier. The 
accumulator must therefore be of that length if it 
is to store the complete product. Actually, the 
product can be accumulated partly in the accumu- 
lator (major product) and partly in the multiplier 
register (minor product). Each digit of the minor 
product (the least significant one in the accumu- 
lator) obtained during any addition cycle is stored 
in the extreme left-hand position of the multiplier 
register, which is vacated during each shift of the 
multiplier-register contents. 


2. Basis for Shortcut Multiplication 


Whenever any multiplier digit is a 0 and nothing 
is to be added to the partial-product sum, it is 
obvious that that addition cycle could be omitted 
entirely, and that only a shift need be performed. 
A sequence of zeros would allow omitting as many 
addition cycles as there were zeros. The idea will 
now be developed that if a sequence of 1’s occurs 
in the multiplier, cycles corresponding to these 
1’s may also be omitted in much the same way, if 
subtraction of the multiplicand from the partial- 
product sum is provided for. 

Consider a string of n consecutive 1’s in the 
multiplier. The placement of the binary point is 
arbitrary so far as this argument is concerned, and, 
for the sake of simplicity, let it be located to the 
right of the right-hand 1. Then the numerical 
value of just this string of n 1’s is 


Qr1pon2tonat | 1914 90, 


It is quite easily shown that this expression is 
equal to 


20. 2°. 


Therefore, in computing the partial products 
corresponding to this string of n 1’s, one subtrac- 
tion of the multiplicand for the right-hand 1 
and one addition of the multiplicand for the 0 to 
the left of the string are sufficient. Such a se- 
quence of 1’s begun by a subtract cycle is referred 
to hereafter as a subtract sequence. 

The fact that it is possible to compute a product 
correctly by omitting cycles that correspond to 
strings of 0’s and to strings of 1’s is the basis for 
this method of shortcut multiplication. 

During the multiplication process, it is a funda- 
mental requirement that the contents of the 
accumulator (and of the multiplier register) be 
shifted one digit position to the right for every 
digit of the multiplier, regardless of whether or not 
any of the cycles are omitted. Therefore, in 
order that the multiplication time be shortened 
to the fullest extent by virtue of omitting cycles of 
the operation, arrangements should be made for 
accomplishing multiple-position shifts as rapidly 
as possible. Ideally, the shifts that correspond 
to cycles to be omitted should be combined with 
the shift that corresponds to the last cycle actually 
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performed. Thus, for example, if 3 cycles are to figure 2.1 where shifts of 1, 2, 3, and 4 positions 
be omitted, 1 shift of 4 digit positions should be are provided for (corresponding to the omission 
executed instead of 4 shifts of 1 digit position each. of 0, 1, 2, and 3 cycles). First, shift paths of the 

From considerations of the logical gating re- same lengths as are provided for the accumulator 
quired to accomplish any shift, it is evident that . must also be provided for the multiplier register 
the number of different shift paths must be limited in order to have the proper multipher digits at 
to only a few. But it will become evident that a hand for controlling the next cycle. Second, 
point of diminishing return exists such that there provision must be made for examining a number of 
is little need for exceeding the provision to omit succeeding multiplier digits at once in order to 
either 1, 2, or 3 cycles at any one time. determine how many of the succeeding cycles are 

There are two important features that must be to be omitted, and whether the multiplicand 
incorporated in a device using a multiplication should be added or subtracted (or neither) during 
scheme such as this. These are illustrated in the current cycle. 


INFORMATION 
SIGNALS 


CONTROL 
SIGNALS 


CARRY GENERATION 


eee Ree FOUR LEAST nin 
, ——— SIGNIFICANT 


ADDER— 


ACCUMULATOR peels 
ay, ae, RIGHT SHIFT J 
| RIGHT SHIFT 2 
; Fe : RIGHT SHIFT 3 | | : 
wy “——~ RIGHT SHIFT 4 ADD-SUBTRACT e— | 
R , IMULTIPLICAND fey | |] 
| rE 
SIGN DIGIT PEC Et OF 
a — Ht Wir 
rele aicar LS 
TIMER tt 2 SHIFT 26° 
| epee | BOE TERMINER ar 
ee RIGHT SHIFT 
= Rist SuIET 3 
ard } RIGHT SHIFT | 
| I NO SHIFT 
IPLIER ES& FOUR LEAST = 
LIE ____ === [SIGNIFICANT DIGITS 


SIGNIFICANT 
DIGITS 


Figure 2.1. Schematic diagram for shortcut multiplier. 


3. Rules for Governing Shortcut Multiplication 


The previous paragraphs have outlined the basis then its numerical value (assuming the sequence 
for speeding up the multiplication process by to be an integer) is 
omitting cycles of the operation. The means for 


governing such a procedure, in order to be mechan- 274-28 2°-+-2?-+- 21-29, 
ized, must be reduced to a set of well-defined rules. 

_ Before the rules are set down in detail, an ad- which can also be expressed as 
ditional fact relating to a subtract sequence will 98__94__92__90 


be explained. For example, if the sequence is 


This means that if a sequence of 1’s in the multi- 
11101011, plier is begun by subtracting for the right-hand 1, 
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cycles for the rest of the 1’s can be omitted, but 
cycles for the 0’s become subtractions. Note that 
in the example the two 0’s correspond in value to 
2? and 2*, and that these appear negatively in the 
last expression. 
In the succeeding discussions, it is assumed that 
any number of cycles from 0 to n may be omitted 
at any one time, 1. e., that the accumulator and 
multiplier registers can shift any number of posi- 
tions from 1 ton+1. For the sake of brevity in 
the discussions to follow, an omitted or nonomitted 
cycle corresponding to any multiplier digit will be 
referred to simply as an omitted or nonomitted 
multiplier digit. The action proceeds from right 
to left through the sequence of multiplier digits. 

Developing the rules for governing this method 
of shortcut multiplication falls naturally into two 
parts. First are the rules for determining whether 
to add or subtract the multiplicand or to do 
neither. This latter case, which constitutes a 
trivial cycle, is required only because the number 
of different shift paths, and hence the maximum 
amount of shift, is restricted for reasons of equip- 
ment economy; it is not always possible to omit as 
many cycles as could be omitted if there were no 
restriction on the maximum amount of shift. 
Second are the rules that determine how many 
digit positions the accumulator and multiplier 
registers are to be shifted. As stated before, the 
number of positions ranges from 1 to n-+1. 

Both sets of rules depend on (a) the current 
right-most digits of the multiplier after the shift 
for the previous cycle has been accomplished, and 
(b) the sign of the partial-product sum, 1. e., 
whether the last nontrivial cycle was an add cycle 
or a subtract cycle. 


3.1. Rules for Adding or Subtracting the 
Multiplicand 


Case I. Partial-Product Sum Not Negative (Add 
Sequence): 


If the partial-product sum is not negative, as it 
is in the initial state or as the result of adding for 
the last nontrivial cycle, the multiplicand is to be 
added if the current right-most digits of the 
multiplier are XX01. (Unspecified digits are 
immaterial.) It is not advantageous to begin a 
subtract sequence for a single 1 because an addi- 
tion would be required during the very next cycle 
for the indicated 0. But for two or more 1’s it is 
advantageous, so that if the current right-most 
digits are X X11, the multiplicand is to be sub- 
tracted from the partial-product sum. 

If during an add sequence the current right- 
most multiplier digits are X X-X0 (either an initial 
condition or a condition that might arise because 
a shift of more than n+1 digit positions is not 
permitted), the multiplicand is to be neither 
added nor subtracted, and the current cycle is a 
trivial one. 


Case II. Partial-Product Sum Negatiwe (Subtract 
Sequence): 


If the partial-product sum is negative, as the 
result of subtracting for the last nontrivial cycle, 
the multiplicand is to be subtracted if the current 
right-most digits of the multiplier are X X10. 
In this instance, the 0 is treated like those in the 
subtract sequence described earlier in this section. 
It turns out to be advantageous to maintain a 
subtract sequence for isolated 0’s. But for two 
or more consecutive 0’s it is advantageous to 
terminate the subtract sequence, so that if the 
current right-most digits are X X00, the multi- 
plicand is to be added to the partial-product sum. 

If during a subtract sequence the current right- 
most multiplier digits are XX X1, a condition that 
might arise because a shift of more than n-+-1 digit 
positions is not permitted, the current cycle is a 
trivial one, and the multiplicand is to be neither 
added nor subtracted. 

In table 2.1 are given practical add-subtract 
rules in a form that can easily be mechanized. 


TABLE 2.1. 


Rules for adding or subtracting 


Action Required condi- | 


tions ! 
Add multiplicand_....._-___-- 01/+ or 00/— 
Subtract multiplicand_._._____ 11/+ or 10/- 


! The digits shown are the current right-most digits of the multiplier. The 
sign indicates whether the partial-product sum for the last nontrivial cycle is 
positive or negative, i. e., whether the last nontrivial cycle was add or sub- 
tract. 


3.2. Rules for Shifting 


The general rules for omitting cycles, which 
follow easily from the precepts given in section 2, 
are as follows. 

1. If an add sequence is initiated or maintained, 
i. e., if the cycle to be executed is an add cycle 
(or, in case it is trivial, if the last nontrivial one 
was an add cycle), the rule is: consecutive 0’s in 
number from one to n immediately to the left of 
the right-most multiplier digit are to be omitted. 

2. If a subtract sequence is initiated or main- 
tained, 1. e., if the cycle to be executed is a sub- 
tract cycle (or, in case it is trivial, if the last non- 
trivial one was a subtract cycle), the rule is: 
consecutive 1’s in number from one to n immediately 
to the left of the right-most multiplier digit are 
to be omitted. 

It should be clear that inasmuch as cycles are 
being executed continually during the multiplica- 
tion process, the current right-most multiplier 
digit is never omitted. | 

As the rules for shifting follow directly from the © 
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rules for omitting cycles, the rules for shifting 
have as their criteria: 

1. whether the current cycle initiates or main- 
tains an add sequence, and 

2. the number of consecutive 0’s or 1’s to the 
left of the current right-most multiplier digit. 
If an add sequence is initiated or maintained, the 
number of consecutive 0’s determines the shift; if 
a subtract sequence is initiated or maintained, 
the number of consecutive 1’s determines the shift. 
In any event, only the sign of the partial-product 
sum resulting from the last nontrivial cycle and 


the current right-most digits of the multiplicr are 
required as criteria for the rules, which are given 
in table 2.2. By comparing tables 2.1 and 2.2 it 
can be verified that the specific rules given in 
table 2.2 contain implicitly the conditions of ini- 
tiating or maintaining an add or a subtract 
sequence. 

As mentioned previously, it will be shown that 
there is little need for omitting more than three 
multiplier digits at any one time, and hence in 
the practical rules given, 7 is equal to 3. For n 
ereater than 3, the rules given in table 2.2 can be 
extended in a manner that should be fairly obvious. 


TABLE 2.2. Rules for shifting 


Action 


DShie 1-dieitt postions ac. oo a le 
Shift 2 digit positions_..........._.__.-.- 
Shift 3 digit positions.._._.._._...-_.____- 
Shift 4 digit positions.._________--_--_---- 


Required conditions ! 


10/+ or 01/-— 

101/+ or 010/— or 100/+ or O11/+ 
1001/+ or 0110/— or 1000/+ or 0111/+ 
0001/-+ or 1110/— or 0000/-£ or 1111/+ 


1 The digits shown are the current right-most digits of the multiplier. The sign indicates whether the partial-product sum for the last nontrivial cycle is 


positive or negative, i. e., whether the last nontrivial cycle was add or subtract. 


4. Effectiveness of Shortcut Multiplication 


The effectiveness of this method of multiplica- 
tion can be expressed in terms of percentages or 
ratios of the number of cycles actually required 
to the total number required without shortcutting. 
The smaller this ratio, the more effective is the 
method. 

For the sake of clarity in the examples to be 
given, certain symbols to be placed atop each 
multiplier digit are defined as follows: 


e=a multiplier digit for which nothing is added 
to or subtracted from the partial-product 
sum; a trivial cycle. 
+=a multiplier digit for which the multiplicand 
is added to the partial-product sum. 
— =a multiplier digit for which the multiplicand 
is subtracted from the partial-product sum. 
*=-a, multiplier digit corresponding to an omitted 
cycle. 


The lower limit of effectiveness can be deter- 
mined by finding repetitive sequences of multi- 
plier digits such that the least number of digits 
can be omitted, obeying the general rules given in 
the last section. Thus, it can easily be verified 
that for n=1 an example of sequences that result 
in omitting the least number of cycles is 


oe ene Se See oe ee eae 


. . . 000111000111000111000111, 


and that the number of cycles actually required is 
two-thirds of the number of digits. Likewise, it 
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can be seen that for n>1, examples of sequences 
that result in omitting the least number of cycles 
are 
ie bok ok bok ok bok ok bok ok bok — 
i . 00110011001100110011, 
an 


In these cases the number of cycles required is 
one-half of the number of digits. These sequences 
represent the worst possible conditions from the 
standpoint of speed. It is unlikely that they would 
occur very often in practice, although they do 
occur in the binary representation of certain deci- 
mal fractions, such as 4, 4, 75, d, ete. 

The upper limit of effectiveness is attained with 
strings of similar digits whose lengths are multiples 
of n+1. It is easy to show that the number of 
cycles actually required for this kind of sequence is 
1/(n+1) times the number of digits. For n=3, 
an example of such a sequence is 


ook ok OK °c oe ok KOK 8 Ok KKK 


ok 2k eK: 
. . 000000000000111111110000. 


In this case, which represents the best possible 
conditions from the standpoint of speed, the num- 
ber of cycles required is only one-fourth of the 
number of multiplier digits. There are, of course, 
many such “best”? sequences, and it is likely that 
this kind of sequence, at least for parts of words, 
would occur rather often in practice. 


From the practical standpoint, perhaps the most 
meaningful determination of effectiveness is the 
one for a purely random sequence of multiplier 
digits. A mathematical analysis! shows that the 
limiting value of the ratio of the number of cycles 
required (J) divided by the number of multiplier 
digits (m), as m increases without bound, is 


— (n+1) 
lim M Le aa 


M= oc Mm 2 1—27-@tD 


It is obvious that if n, the maximum number of 
cycles that may be omitted at once, also increases 
without bound, the limit of this ratio is one-third. 
For a practical value of m, say, 64, numerical cal- 
culation based on the results of a general mathe- 
matical treatment ? shows that the ratio is greater 
than the limiting value by only about 0.01. The 
results of such numerical calculations are sum- 
marized in figure 2.2. 

Figure 2.2 shows graphically for various values 
of n the ratio of the number of cycles actually 
required to compute a product to the number of 
digits in the multiplier. Besides the results for 
random sequences, there are also shown the re- 
sults for best sequences and worst sequences. Al- 
though the random-digits points are constructed 
from computed values for multipliers of a par- 
ticular length, namely, 64 binary digits, these 
results do not differ significantly from the results 
for somewhat shorter or longer words. 


Figure 2.2 shows that if the maximum number 
of cycles that can be omitted at any one time is 
three, the average time to compute a product, 
using a large number of random multipliers, is 
about 40 percent of the usual multiplication time, 
all other things being equal, or that the speed 
factor due to shortcutting is about 2.5. 

In practice, the multiplier digits are often not 
random, nor even apparently so. However, ex- 
perience seems to indicate that departures from 
randomness will usually favor the sequences of 
similar digits that require fewer cycles rather than 


those special sequences that require more cycles. 
As a result, the average number of cycles required 
for multiplications will generally turn out in 
practice to be less than the theoretical number for 
random sequences obtained from figure 2.2. If 
the curve were plotted for the average number of 
cycles required for multiplier sequences actually 
occurring in practice, consisting of a mixture of 
random or pseudo-random digits and strings of 
similar digits, it would be found that this curve 
would lie between the curve for random sequences 
and the curve for the best sequences. 

It is evident from an examination of figure 2.2 
that, so far as random sequences are concerned, 
the number of cycles required to compute a prod- 
uct decreases but little as nm is increased beyond 
three. This means that for random sequences 
there is little need for providing the means to 
omit more than three cycles at once. However, 
for nonrandom sequences such that the curve for 
the best sequences is applicable, it can be seen 
that making n=7 would halve the number of 
cycles required for n=3. Thus, for sequences 
consisting of a mixture of random digits and 
strings of similar digits, only the nonrandom por- 
tions would be significantly affected by in- 
creasing n. 
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Ficure 2.2. Effectiveness of shortcut mulitplication. 


5. Appendix 1. Speed for Infinite-Length Multiplier 


The average rate at which digits of an infinitely 
long multipher sequence are disposed of in digits 
per cycle is to be determined, under the assump- 
tions that the multiplier sequence is purely ran- 
dom, and that the add-subtract and shift rules 
stated in the body of the paper apply. This 
average rate is a fairly good approximation to the 
speed factor achieved for random sequences of 
practical length. 


1 See Appendix 1. 
2See Appendix 2. 


The probabilities of shifting by various amounts 
from a nonomitted digit will now be derived for 
specific conditions. ‘These probabilities follow 
directly from the shift rules stated in table 2.2. 
All the conditions, stipulated by the sign of the 
partial-product sum and the current nonomitted 
(right-most) digit of the sequence, result in only 
two sets of shift probabilities. These conditions 
are grouped according to the resulting shift prob- 
abilities and are designated as case 1 and case 2. 
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Case 1: 


The pertinent specific conditions are (a) partial- 
product sum not negative and current right-hand 
(nonomitted) digit 1, or (b) partial-product sum 
negative and current right-hand digit 0. 

Table 2.2 shows that under these conditions 
there can never be a shift of just one position. In 
order for there to be a shift of two positions, it is 
required that the table 2.2 states be 101/+, 011/+, 
100/—, or 010/—. The number of different states 
containing each of conditions (a) and (b), namely 
two, is just one-half of the four pertinent combina- 
tions ending in 1/+ and 0/—, respectively.’ 
Therefore, because random sequences are being 
considered, the probability of shifting two posi- 
tions from a nonomitted digit is $ for case 1 condi- 
tions. By similar reasoning it can be seen that 
the probability of shifting three positions is +. 
Then by extension, and as the rules are similar 
for a shift of n and for a shift of n-+1 positions, the 
following table (2.3) of shift probabilities for case 
1, designated by V/ can be compiled. It should be 
borne in mind that n has the same definition as in 
the body of the paper, i. e., n is the maximum 
number of digits omitted at once. The amount 
of shift ranges from 1 to n-+1. 


TABLE 2.3. Case 1 shift probabilities 
y) 1 2 8 a n n+1 
Vi 0 4 3 ea Se Q2-(n-1) Q-(n-1) 


Case 2: 


The pertinent specific conditions are (a) partial- 
product sum not negative and current right-hand 
digit 0, or (b) partial-product sum negative and 
current right-hand digit 1. 

Table 2.2 shows that in order for there to be a 
shift of one position, it is required that there be 
10/-++ or 01/—. But the number of these is just 
one-half of the pertinent combinations ending 
in either 0/+ or 1/—. Hence the probability of 
shifting one position is 4 for case 2 conditions. 
By similar reasoning it can be seen that the 
probability of shifting two positions is 4, and 
so on. Table 2.4, showing the case 2 shift proba- 
bilities, designated by W’, is constructed in the 
same manner as table 2.3. 


TaBLeE 2.4. Case 2 shift probabilities 
7 a co: a ae eae. n+] 
WwW 4 i 3} Oe ees age Oe ee 


3 For example, the four combinations ending in 1/+ are 001/+, 011/-++, 101/+-, 
and 111/+. 


18 


A fact regarding the combined effects of the 
add-substract rules of table 2.1 and the shift rules 
of table 2.2 that can easily be verified is that a 
shift of any amount less than n+1 always results 
in case 1 conditions, and that a shift of just n-+1 
positions results in case 1 conditions and case 2 
conditions with equal probability. This latter 
statement is true because, so far as the rules are 
concerned, the right-hand digit after a shift of 
n+1 positions is completely unknown prior to 
the shift, and it can be a 1 or 0 with equal proba- 
bility. It is also a fact, evident from an inspection 
of table 2.1, that the so-called trivial cycles, for 
which neither addition nor subtraction is done, 
are the very ones for which the case 2 conditions 
apply; conversely, cycles for which case 2 condi- 
tions apply are all trivial cycles. 

The following argument, which evaluates the 
probabilities of occurrence of case 1 and case 2 
conditions, is based on the assumption that the 
multiplier sequence is sufficiently long so that the 
probabilities for one cycle are identical to the 
probabilities for the preceding cycle. 

Let Q be defined as the probability that case 2 
conditions apply for any cycle. Then, clearly, 
1—Q is the probability that case 1 conditions 
apply. Since it is noted in the preceding para- 
graph that case 2 results from a shift of n+1 
positions with a probability of one-half, it follows 
that the probability of having had a shift of n+1 
positions is 2Q. 

Now, from tables 2.3 and 2.4, it can be seen 
that the probability of shifting n-+1 positions 
for case 1 is 2~@—)), and for case 2, 2~". Then 2Q 
can be evaluated by taking the weighted mean 
of the shift probabilities for the two cases. 


2Q= (1—Q)2-@-P + Q2-". (1) 
After some reduction, there is obtained 


Q-(n—-1) 


Q=aT9-W (2) 
and 
2—2-" F 
1—@=5 79-9 for n=1. (3) 


Equations (2) and (8) are the expressions for the 
probabilities of having case 2 and case 1 condi- 
tions, respectively, after an infinite number of 
cycles. 

The next step is to determine the average shift 
for case 1 and for case 2. This can be done from 
the information contained in tables 2.3 and 2.4. 
The average shift is simply the weighted mean of 
all the possible shifts for each case. Thus, if A is 
defined to be the average shift for case 1, we have 


A=D3 2-4 (WF YO, (4) 
i=2 


which by well-known methods is reduced to 
A=3—2-@-), forn>1. (5) 


Similarly, if B is defined to be the average shift for 
case 2, we have 


B=> j2-*- (n+41)2- (6) 


=2—2-", forn>0. (7) 

The average rate of shifting is the weighted 

mean of A and B, using for the weights 1—@ and 

Q, respectively. Thus the average rate of shift- 
ing 1s 

L=(1—Q)A+ QB, (8) 

which, through the use of eq (2), (3), (5), and (7), 


becomes 


{—27- (t+) 


L=35 50" 


for n>1. (9) 


The quantity Z is the average amount of shift 
per cycle, 1. e., the average number of digits dis- 
posed of in each cycle. The reciprocal of this is 
the average number of cycles required per digit. 
If the total number of cycles required to compute 
a product is M, and if the number of digits in the 
multiplier is m, then we have 
1 . Me 114+2-@¢*) 
Do nn 3 1a 

In table 2.5 are given the values of 1/Z for 
various values of n. For the sake of complete- 
ness, the value for n=0 (the usual multiplication 
method) is included, which is unity. 


forn>1. (10) 


TABLE 2.5. Relative multiplication times 


2 3 4 5 6 00 
3| 17) 11] 65 | 43 1 
7 | 45 | 31 | 189 | 127 | °°" 3 


6. Appendix 2. Speed for Finite-Length Multiplier 


The average number of cycles required to 
generate a product is to be determined under the 
assumptions that the multiplier is a purely random 
number of finite length and that the add-subtract 
rules and the shift rules stated in the body of this 
paper apply. . 

To anticipate the discussion, a quantity X; 1s 
defined to be the probability that the kth digit of 
a sequence is not omitted—i. e., the probability 
that one cycle will be expended in passing over 
the kth digit. Then the total number of cycles 
expended in passing over m digits of an m-digit 


m™ 
sequence is >)Xx. It is the purpose in this 
k=1 


appendix to derive an expression for this quantity 
in terms of known constants and parameters. 
To begin the discussion, the definitions of 
certain terms are given. 
R,=the probability of omitting the Ath 
digit. 
X;,= the = ropabiliey of not omitting the 
kth digit. 
P,=the probability that the kth digit cor- 
responds to an add or a subtract 


cycle. 

Q,—the probability that the kth digit cor- 
responds to a trivial cycle. 

Sj=the probability of shifting 7 places 
from the kth digit, provided the kth 
digit is not omitted. 

X,S{=the actual probability of shifting 7 
places from the kth digit. 

n=the maximum number of digits that 
may be omitted at once, and the 
amount of shift may range from 1 
to n-+1. 


Note that the superscript on S is not to be inter- 
preted as an exponent. 

From these definitions some rather obvious 
relations can be derived. 


Ay+R,=) (1) 
Pit Qi= Xi (2) 
n+1 
j= 


where n-+1 is the maximum amount of shift. 

If the digit & is not omitted, then during the 
previous cycle when the nonomitted digit was, 
say, k—p, there must have been a shift of exactly 
p positions. The probability that the kth digit 
is not omitted is therefore given by 


n+1 
Ans 25 Ae Shp. (4) 


In appendix 1 it is shown that under all cir- 
cumstances there are only two sets of probabilities 
for shifting various amounts, designated case 1 
and case 2. These shift probabilities are given in 
tables 2.3 and 2.4 and are labeled V’ and W’, 
respectively. For case 1, V? is the probability 
that a shift of 7 places will take place, and for 
case 2, W’ has a similar definition. Note that 
the superscripts on V and W are not to be inter- 
preted as exponents. 

It is also brought out in appendix 1 that the 
trivial cycles are the ones for which the case 2 
shift probabilities apply, and conversely. This 
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means then that the probability that case 2 
conditions apply for any digit k equals Q;, and 
that the probability that case 1 conditions apply 
equals P,. Therefore, the total probability of 
shifting 7 places from the kth digit is 


Now, the probability of occurrence of case 2 
conditions, 1. e., Q,, is one-half the probability 
that there was a shift of n+-1 positions during the 
previous cycle (noted in appendix 1): 


Qr= 3 Xn intrest (nD: (6) 


Then, through the use of eq (5) and (6) and the 
values of Vt! and Wt! from tables 2.3 and 2.4, 
there is obtained 


Oc=3[Qe- gn 2 +P rea 2 I), (7) 
which becomes 
OQ 2 "Psa re Vie (n+1)° (8) 


If the digit k is omitted during the cycle cor- 
responding to the nonomitted digit k—p, then 
there must be a shift of any amount greater than 
» during this cycle. For example, if the (k—4)th 
digit is the nonomitted digit, then the shift must 
be either 5,6,7, . . .,orn-+1, and in this instance 
the probability of omitting the kth digit is 


n+1 : 
i= 


If, for the omitted digit hk, all possible nonomitted 
digits to the right, k—p, are considered, the 
probability of omitting this Ath digit is the sum 
of such expressions for all possible p. 


n n-+-1 
R= 33] Xen 21 St-» | (9) 
p=1 j=p+1 
If use is made of 
n+l. 
2)5=1, (3) 


and if the term for p=1 is separated, eq (9) 
becomes 


nN nm+1 
Ry=Xy-1(1— Se-1) peat E= St.» | (10) 
p=2 _ J=p+i 
From eq (5) can be derived 


sii aoa i Ww, (11) 


from which is easily obtained 


m+1 Py n+1 O, % 
2) Si= 24 ee a aE We (12) 
j=D+ XE 2 =p+ Xi jp 
which by virtue of the character of V and W 
(see tables 2.3 and 2.4) is 


n+l P 0 
fo k py tk TW 
= S2, for 2<p<n. (14) 


Therefore, eq (10) becomes 
Ry=Xz10—Si, a) +2)Xi-oS = —Dp) | (15) 


and through the use of eq (4) there is obtained 
By=Xy-1— Xi-1 pit AK 
— Xy-1Sp-1 — Xe ney Seep: (16) 


After some reductions, using eq (5) and (6), eq 
(16) becomes 


Boy Xi— Qe + Xt-1— Via Qi, (17) 
which, by virtue of eq (2), is 
Ry=Pyt+Pr-1— Qe. (18) 
But because of 
X,+Bk,=1, (1) 
eq (18) is seen to be 
1—X;,=P,+Pr-i— Q:, (19) 


which after further reductions, using eq (2), 


becomes 
=9(1—Py,_1). (20) 


Equation (2), with the aid of eq (20) and (8) is 
now to be summed to yield a closed expression in 
terms of known constants and parameters. That 
is to say, it is desired to find an expression for 


3 X= Dy Prt 3s Oe 21) 


The summation of P; can be determined readily 
through the use of eq (20) if the initial condition 
is stipulated. The initial condition is that the 
first digit is never omitted; 1. e., X;=1. Now 
as the zero-th digit is nonexistent, Py is not defined. 


Therefore, eq (22) follows readily from eq (20) and 
(2). 
P 1=Qi=3. (22) 


If eq (20) is expanded until P, is reached, there 
is obtained 


Py= 27-2742 |, 
ane ene 2). 20) 
from which by well-known techniques is obtained 
P,=31—(—2)-, for k> 1. (24) 


The summation of P, is then 


S07 g ae) 


=1[3m+ 1 (—2)~”], for m>1. 


a 


(25) 


To obtain the summation of Q;, eq (8) will be 
expanded in a straightforward manner until the 
first Q, is reached for which zg<n-+l. 


Q=2" Pray re U2 Pree «4. 
Qe ETD! Pe pendtiy ye xs, «4 
$27 EDO” Pr scape FAY Oa sinten » 
(26) 


where s is the integral number of times k—1 is 
divisible by n+1. It is important to note that 
Q,=0 for 2<z2<n+1. This is true because Q, is 
one-half the probability of having shifted n+1 
positions during the previous cycle, which is clearly 
impossible for z<n+1. As the value of Q; is not 
0, (eq (22)), the expansion of Q, may have only 
Q, in the Q-term. The summation of Q; can now 
be written as 


m m—(n-+1) m—2(n+1) 
Oo er ee Eros 
k=1 k=1 k=1 

m—i (n+1) 
+9-@¢-) @t)) .Q7-%” 2 P,+ Mee 
m—t (n+1) 
+2-¢-D m+) 2g y P, 
k=1 
t 
Feige ee, (27) 
i=0Q 


where ¢ is the integral number of times m—1 is 


divisible by n-+1. This in turn can be written as 


m t m—i(n+1) t 
ps O.=2 pS J—i(nt+1) > Ps +9-1! > Q-i(ntl) 
k=1 i=l k=1 i=(Q 
(28) 


Substituting into this the expression for >} P,, 
eq (25), and using eq (21), there is obtained 


> i= $[8m+1—(—2) m+ aD aot 
XK [3m—31(n+ 1) +1—(—2)*@ th —m] 


t 
HSN Q-i@tD form>landn>0, (29) 
i=0 


mn 1 
2aki= 6 [3m+1—(—2)~™] 
fi 
9 Q- (n+1)__9- (t+) (a+) 
+9 (3m+1) ir Gey 
y) DEE isa) erty) 
3 (n+1) a 
— (t+1) (n+) 


+5 (n-+ ie [9 Fi 


1—2-@+D (n+1) 


2 (1—2-@FD)’ 
(30) 


5 Q-m > (—1)m-tty 
i=1 


where m>1, n>0, and where ¢ is the integral 
number of times m—1 is divisible by n-+-1. 


The formula for the number of cycles required 
to generate a product is complete except for the 
terminal effect. An extra cycle is needed at the 
end to terminate if the last nontrivial cycle was a 
subtract and if the last digit is a 1. According 
to the rules, a subtract cycle is imitiated by 11, 
and it is terminated by 00. The probability that 
the last nontrivial cycle is a subtract, to be called 
Z, is the probability that there is 11 somewhere 
in the sequence that is not followed by 00. Some 
examples follow, in which only the left-hand 
portions of the sequences are shown. 


(a) 11. . 

(b) 1011. 

(c) 101011. 
(d) 10101011. 
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It should be noted that these sequences are 
mutually exclusive. The probability of occurrence 
of (a) is 2~*; of (b), 274; of (c), 27°; of (d) 27%; ete. 
If the word is m digits long, the last sequence of 
this kind has a probability of occurrence 2~” if m 
is even, or 2~“-) if m is odd. This latter fact 
results from the requirement that all the sequences 
be mutually exclusive. Now Z is the sum of all 
these probabilities. 


3m 
LZ 7 if m 1s even, or (31) 
k=i 


3(m—1) 
Z= >. 27% 


k=] 


if m is odd. (32) 


These reduce to 
Z=i(1—2-”") if miseven, or (83) 
Z=1(1-2'-") ifmisodd. _—(34) 


The total number of cycles required, to be 
called M, is therefore 


M= DXtZ (35) 


Figure 2.2 contains a plot of the values of MZ/m 
for m=64 and for various values of n. A discus- 
sion of figure 2.2 is in the body of the paper. 


U.S. GOVERNMENT PRINTING OFFICE: 1960 


THE NATIONAL BUREAU OF STANDARDS 


The scope of activities of the National Bureau of Standards at its headquarters 1 in Washington, 
D. C., and its major field laboratories in Boulder, Colorado, is suggested in the following listing 
of the divisions and sections engaged in technical work. In general, each section carries out 
specialized research, development, and engineering in the field indicated by its title. A brief 
description of the activities, and of the resultant publications, appears on the inside front cover. 


WASHINGTON, D. C. 


Electricity and Electronics. Resistance and Reactance. Electron Devices. Electrical In- 
struments. Magnetic Measurements. Dielectrics. Engineering Electronics. Electronic In- 
strumentation. Electrochemistry. 

Optics and Metrology. Photometry and Colorimetry. Optical Instruments. Photographic 
Technology. Length. Engineering Metrology. 

Heat. Temperature Physics. Thermodynamics. Cryogenic. Physics. Rheology. Engine 
Fuels. Free Radicals Research. 

Atomic and Radiation Physics. Spectroscopy. Radiometry. Mass Spectrometry. Solid 
State Physics. Electron Physics. Atomic Physics. Neutron Physics. Nuclear Physics, 
Radioactivity. X-rays. Betatron. Nucleonic Instrumentation. Radiological Equipment. 
AEC Radiation Instruments. 

Chemistry. Organic Coatings. Surface Chemistry. Organic Chemistry. Analytical Chem- 
istry. Inorganic Chemistry. Electrodeposition. Molecular Structure and Properties of Gases. 
Physical Chemistry. Thermochemistry. Spectrochemistry. Pure Substances. 

Mechanics. Sound. Mechanical Instruments. Fluid Mechanics. Engineering Mechanics. 
Mass and Scale. Capacity, Density, and Fluid Meters. Combustion Controls. 

Organic and Fibrous Materials. Rubber. Textiles. Paper. Leather. Testing and Speci- 
fications. Polymer Structure. Plastics. Dental Research. 

Metallurgy. Thermal Metallurgy. Chemical Metallurgy. Mechanical Metallurgy. Corro- 
sion. Metal Physics. 

Mineral Products. Engineering Ceramics. Glass. Refractories. Enameled Metals. Con- 
creting Materials. Constitution and Microstructure. 

Building Technology. Structural Engineering. Fire Protection. Air Conditioning, Heat- 
ing, and Refrigeration. Hor, Roof, and Wall Coverings. Codes and Specifications. Heat 
Transfer. 

Applied Mathematics. Nurnerical Analysis. Computation. Statistical Engineering. Math- 
ematical Physics. : 

Data Processing Systems. SEAC Engineering Group. Components and Techniques. Digital 
Circuitry. Digital Systems. Analog Systems. Application Engineering. 

@ Office of Basic Instrumentation @ Office of Weights and Measures 


BOULDER, COLORADO 


Cryogenic Engineering. Cryogenic Equipment. Cryogenic Processes. Properties of Ma- 
terials, Gas Liquefaction. 

Radio Propagation Physics. Upper Atmosphere Research. JIonospheric Research. Regular 
Propagation Services. Sun-Earth Relationships. 

Radio Propagation Engineering. Data Reduction Instrumentation. Modulation Systems 
Navigation Systems. Radio Noise. Tropospheric Measurements. Tropospheric Analysis 
Radio Systems Application Engineering. 

- Radio Standards. High Frequency Electrical Standards. Radio Broadcast Service. High 
Frequency Impedance Standards. Calibration Center. Microwave Physics. Microwave 
Circuit Standards. 


